Boltzmann Games
نویسندگان
چکیده
A Boltzmann game is an n-player repeated game, in which Boltzmann machines are employed by players to choose their optimal strategy for each round of the game. Players only have knowledge about the machine they have selected and their own strategy set. Information about other the players and the game’s pay-off function are concealed from all players. Players therefore select their strategies independent of the choices made by their opponents. A player’s pay-off, on the other hand, will be affected by the choices made by other players playing the game. As an example of this game, we play a repeated zero-sum matrix game between two Boltzmann machines. We show that a saddle point will exist for this type of Boltzmann game.
منابع مشابه
Reinforcement Learning in Multi-agent Games
This article investigates the performance of independent reinforcement learners in multiagent games. Convergence to Nash equilibria and parameter settings for desired learning behavior are discussed for Q-learning, Frequency Maximum Q value (FMQ) learning and lenient Q-learning. FMQ and lenient Q-learning are shown to outperform regular Q-learning significantly in the context of coordination ga...
متن کاملDynamics of Boltzmann Q learning in two-player two-action games.
We consider the dynamics of Q learning in two-player two-action games with a Boltzmann exploration mechanism. For any nonzero exploration rate the dynamics is dissipative, which guarantees that agent strategies converge to rest points that are generally different from the game's Nash equlibria (NEs). We provide a comprehensive characterization of the rest point structure for different games and...
متن کاملDynamics of Softmax Q-Learning in Two-Player Two-Action Games
We consider the dynamics of Q–learning in two–player two–action games with Boltzmann exploration mechanism. For any non–zero exploration rate the dynamics is dissipative, which guarantees that agent strategies converge to rest points that are generally different from the game’s Nash Equlibria (NE). We provide a comprehensive characterization of the rest point structure for different games, and ...
متن کاملEvaluation of two lattice Boltzmann methods for fluid flow simulation in a stirred tank
In the present study, commonly used weakly compressible lattice Boltzmann method and Guo incompressible lattice Boltzmann method have been used to simulate fluid flow in a stirred tank. For this purpose a 3D Parallel code has been developed in the framework of the lattice Boltzmann method. This program has been used for simulation of flow at different geometries such as 2D channel fluid flow an...
متن کاملUsing the Lattice Boltzmann Method for the numerical study of non-fourier conduction with variable thermal conductivity
The lattice Boltzmann method (LBM) was used to analyze two-dimensional (2D) non-Fourier heat conduction with temperature-dependent thermal conductivity. To this end, the evolution of wave-like temperature distributions in a 2D plate was obtained. The temperature distributions along certain parts of the plate, which was subjected to heat generation and constant thermal conductivity condit...
متن کامل